Spectral Clustering in Heterogeneous Networks
نویسندگان
چکیده
Many real-world systems consist of several types of entities, and heterogeneous networks are required to represent such systems. However, the current statistical toolbox for network data can only deal with homogeneous networks, where all nodes are supposed to be of the same type. This article introduces a statistical framework for community detection in heterogeneous networks. For modeling heterogeneous networks, we propose heterogeneous versions of both the classical stochastic blockmodel and the degree-corrected blockmodel. For community detection, we formulate heterogeneous versions of standard spectral clustering and regularized spectral clustering. We demonstrate the theoretical accuracy of the proposed heterogeneous methods for networks generated from the proposed heterogeneous models. Our simulations establish the superiority of proposed heterogeneous methods over existing homogeneous methods in finite networks generated from the models. An analysis of the DBLP four-area data demonstrates the improved accuracy of the heterogeneous method over the homogeneous method in identifying research areas for authors.
منابع مشابه
Measuring Domain Influence in Heterogeneous Networks
Influence can greatly benefit fields like viral marketing, information propagation and recommender systems, while the pervasiveness of heterogeneous networks, such as Twitter, provides richer information for influence research. However, current influence research focuses on analyzing general influence, which assumes various users have similar influence over the network, without fully exploiting...
متن کاملRegularized Spectral Clustering under the Degree-Corrected Stochastic Blockmodel
Spectral clustering is a fast and popular algorithm for finding clusters in networks. Recently, Chaudhuri et al. [1] and Amini et al. [2] proposed inspired variations on the algorithm that artificially inflate the node degrees for improved statistical performance. The current paper extends the previous statistical estimation results to the more canonical spectral clustering algorithm in a way t...
متن کاملMLCA: A Multi-Level Clustering Algorithm for Routing in Wireless Sensor Networks
Energy constraint is the biggest challenge in wireless sensor networks because the power supply of each sensor node is a battery that is not rechargeable or replaceable due to the applications of these networks. One of the successful methods for saving energy in these networks is clustering. It has caused that cluster-based routing algorithms are successful routing algorithm for these networks....
متن کاملDiffusion Model Based Spectral Clustering for Protein-Protein Interaction Networks
BACKGROUND A goal of systems biology is to analyze large-scale molecular networks including gene expressions and protein-protein interactions, revealing the relationships between network structures and their biological functions. Dividing a protein-protein interaction (PPI) network into naturally grouped parts is an essential way to investigate the relationship between topology of networks and ...
متن کاملPrediction of pore facies using GMDH-type neural networks: a case study from the South Pars gas field, Persian Gulf basin
The current study proposes a two-step approach for pore facies characterization in the carbonate reservoirs with an example from the Kangan and Dalanformations in the South Pars gas field. In the first step, pore facies were determined based on Mercury Injection Capillary Pressure (MICP) data incorporation with the Hierarchical Clustering Analysis (HCA) method. In the next step, polynomial meta...
متن کامل